Analysis of the correlation structure for a neural predictive model with application to speech recognition

نویسندگان

  • Li Deng
  • Khaled Hassanein
  • Mohamed I. Elmasry
چکیده

-A speech recogmzer ts developed usmg a layered feedforward neural network to implement speech-frame predwtlon. A Markov cham ts used to control changes in the network's wetght parameters. We postulate that speech recogmtion accuracy ts closely hnked to the capabthty of the predictive model m representing long-term temporal correlattons in speech data. Analyttcal expresstons are obtamed for the correlatton functions for various types of predwttve models (hnear, compressively nonhnear, and )omtly hnear and compresstvely nonhnear) to determme the fatthfulness of the models to the actual speech data Analyttcal results, computer simulattons, and speech recognttton experiments suggest that when compresstve nonhnear predictton and hnear prediction are jomtly performed withm the same layer of the neural network, the model ts better at capturing long-term data correlattons and consequently lmprovmg speech recogmtton performance Keywords--Temporal correlations, Joint linear/nonlinear prediction, Multllayer perceptron, HMM.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Correlation between Auditory Spectral Resolution and Speech Perception in Children with Cochlear Implants

Background: Variability in speech performance is a major concern for children with cochlear implants (CIs). Spectral resolution is an important acoustic component in speech perception. Considerable variability and limitations of spectral resolution in children with CIs may lead to individual differences in speech performance. The aim of this study was to assess the correlation between auditory ...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

طراحی یک مدل مبتنی بر شبکه‌های عصبی برای شناسایی و تجزیه و تحلیل الگوهای غیرطبیعی در نمودارهای کنترل فرآیند

Neural networks because of their abilities are used to patterns recognition. In statistical process control charts, a common cause variation distort expected form of unnatural patterns and so detection of assignable causes efficiently and precisely in a real-time is difficult. Therefore it would be logical to propose models based neural networks for recognition and analysis of patterns in proce...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Neural Networks

دوره 7  شماره 

صفحات  -

تاریخ انتشار 1994